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ABSTRACT 

We report the discovery of 29 promising (and 59 total) new lens candidates from the 
CFHT Legacy Survey (CFHTLS) based on about 11 million classifications performed 
by citizen scientists as part of the first Space Warps lens search. The goal of the blind 
lens search was to identify lens candidates missed by robots (the RingFinder on 
galaxy scales and ArcFinder on group/cluster scales) which had been previously 
used to mine the CFHTLS for lenses. We compare some properties of the samples 
detected by these algorithms to the Space Warps sample and find them to be broadly 
similar. The image separation distribution calculated from the Space Warps sample 
shows that previous constraints on the average density profile of lens galaxies are 
robust. Space Warps recovers about 65% of known lenses, while the new candidates 
show a richer variety compared to those found by the two robots. This detection 
rate could be increased to 80% by only using classifications performed by expert 
volunteers (albeit at the cost of a lower purity), indicating that the training and 
performance calibration of the citizen scientists is very important for the success of 
Space Warps. In this work we present the SIMCT pipeline, used for generating in situ a 
sample of realistic simulated lensed images. This training sample, along with the false 
positives identified during the search, has a legacy value for testing future lens finding 
algorithms. We make the pipeline and the training set publicly available. 

Key words: gravitational leasing: strong - methods: statistical - methods: citizen 
science 


1 INTRODUCTION 

The last few decades have seen a rise in the discoveries 
of strong gravitational lenses owing to the plethora of in¬ 
teresting applications lenses have in astrophysics and cos- 

* anupreeta.moreOipmu.jp 


mology. Strong lenses are routinely used to probe the dark 
matter distribution from galaxy (e.g. Koopmans et al. 2006; 
Barnabe et al. 2009; Leier et al. 2011; Sonnenfeld et al. 2015) 
to group and cluster scales (e.g. Limousin et al. 2008; Zitrin 
et al. 2011; Oguri et al. 2012; More et al. 2012; Newman 
et al. 2013), to study distant young galaxies by using the 
lensing magnification as a natural telescope (e.g. Zitrin & 
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Broadhurst 2009; Zheng et al. 2012; Whitaker et al. 2014), 
to test the cosmological model by constraining cosmologi¬ 
cal parameters such as the Hubble constant and the dark 
energy equation of state (e.g. Suyu & Halkola 2010; Col¬ 
lett et al. 2012; Collett Sz Auger 2014; Sereno & Paraficz 
2014), and many more. Strong lenses are rare, because a 
foreground massive object needs to be sufficiently aligned 
with a distant background source to produce multiple im¬ 
ages. Nevertheless, systematic lens searches have led to the 
discovery of over 500 lenses to date.^ 

The search for gravitational lenses is a needle-in-a- 
haystack problem. Several automated lens finding algo¬ 
rithms have been developed so far (e.g. Lenzen et al. 2004; 
Alard 2006; Seidel & Bartelmann 2007; More et al. 2012; 
Brault & Gavazzi 2014; Gavazzi et al. 2014), but they can 
not simultaneously capture the myriad types of lenses that 
are known to exist. For example, the lensed images of back¬ 
ground galaxies show variety in their surface brightness dis¬ 
tributions, colours, light profiles, shapes, structures and an¬ 
gular image separations. Moreover, many lensed images ap¬ 
pear similar to features found commonly in galaxies (such 
as spiral arms) or to artefacts in astronomical images (scat¬ 
tered light around stars). Almost all lens finding algorithms 
find it difficult to distinguish these from the real lenses and 
thus suffer from a high rate of false positive detections. To 
mitigate this problem, algorithms are often restricted to de¬ 
tect a very narrow class of lens systems. However, even after 
such restrictions, robotic lens searches have to always rely 
on visual screening to produce a sample of plausible lens 
candidates. 

Recognising patterns is one of the strengths of the hu¬ 
man brain. Humans are also capable of dealing with multi¬ 
tiered complex web of questions before arriving at a conclu¬ 
sion, a process which may not be always possible to auto¬ 
mate. The algorithm by which our brains process a task is 
extremely malleable, self-learning and self-evolving. There¬ 
fore it has a huge potential for the discovery of exotic ob¬ 
jects which do not quite fit a set criteria, but are still very 
likely to be objects of interest. The lens finding algorithms 
are not yet advanced enough to produce better performance 
than visual classifications. Consequently, as we enter the era 
of large area imaging surveys spanning thousands of square 
degrees, the participation of a large community of volunteers 
to help with the visual identification of lenses would be very 
beneficial for the lensing community. Now seems the perfect 
time to investigate the potential of citizen science. 

Galaxy Zoo, one of the most successful citizen science 
projects in astronomy, addressed the problem of how to clas¬ 
sify large numbers of galaxies by their morphology (Lintott 
et al. 2008). From these early results to several new unex¬ 
pected and interesting discoveries, such as that of green pea 
galaxies (Cardamone et al. 2009; Jaskot & Oey 2013) and 
Hanny’s Voorwerp (Lintott et al. 2009; Keel et al. 2012), 
Galaxy Zoo has been able to start to realize the potential 
of citizen scientists. Since then, both astronomy and non¬ 
astronomy projects have been launched under the citizen sci¬ 
ence web portal Zooniverse (http://zooniverse.org). The 
task of finding gravitational lenses is significantly challeng¬ 
ing, given that the lens systems show such complexity and 


^ http://admin.masterlens.org/index.php 


that they are rare. To add to the challenge, not many citi¬ 
zen scientists are expected to be aware of the phenomenon of 
gravitational lensing, and the resulting characteristic image 
configurations. With these significant challenges at hand, we 
designed the Space Warps project to enable the discovery of 
lenses through citizen science (learning from previous expe¬ 
rience in serendipitous identification of lens candidates in 
Galaxy Zoo). In a companion paper (Marshall et al. 2015, 
hereafter Paper I), we describe the design of Space Warps 
and how the entire system functions as a discovery service. 
In this paper (Paper H), we describe our first lens search us¬ 
ing data from the Ganada-France-Hawaii Telescope Legacy 
Survey (GFHTLS^). In Kiing et al. (2015), we describe the 
design of a collaborative mass modelling tool that can be 
used by citizen scientists. 

This paper is organised as follows. In Section 2, we 
introduce the GFHTLS imaging data and the previously 
published lens samples from the GFHTLS. We generated 
a training sample, consisting of simulated lenses, duds and 
impostors, in order to aid the Space Warps volunteers in the 
process of finding lenses. We give details of this training 
sample in Section 3 and Section 3.4. In Section 4, we briefly 
describe how the classifications of images from the volun¬ 
teers are turned into a catalog of plausible candidates (for 
further details, see Paper I). In Section 5, we present the 
new lens candidates from Space Warps and compare it to 
the lens samples produced by past robotic searches of the 
GFHTLS. Next, we discuss what kind of lenses are detected 
or missed by the algorithms and Space Warps in Section 6. 
Our conclusions are given in Section 7. 

2 DATA 

2.1 The CFHT Legacy Survey 

The GFHTLS is a photometric survey in five optical 
bands g r'i z'^ carried out with the wide-held imager 
MegaPrime which has a I deg^ held-of-view and a pixel size 
of O.I86^^(Gwyn 2012). The GFHTLS WIDE covers a total 
non-overlapping area of 160 deg^ on the sky and consists 
of four helds WI, W2, W3 and W4. The held WI has the 
largest sky coverage of 63.7 deg^. The helds W2 and W4 
have similar sky coverages of 22.6 deg^ and 23.3 deg^, re¬ 
spectively^. The held W3 has a sky coverage of 44.2 deg^ 
and is more than twice as large as W2 and W4. 

The GFHTLS imaging is very homogeneous and has 
good image quality. Most of the lensed arcs are much 
brighter in the g band, so deep imaging in this band is desir¬ 
able. The limiting magnitude is 25.47 for the g band which 
goes the deepest among all of the hve bands. The mean see¬ 
ing in the g band is 0.78^b The zero point to convert hux to 
AB magnitude for all bands is 30. These characteristics make 
GFHTLS ideal to do visual inspection for hnding lenses. We 
use the stacked images from the hnal T0007 release taken 
from the Terapix website^ for this work. 

We note that the GFHTLS is a niche survey with a 

^ http://www.cfht.hawaii.edu/Science/CFHTLS/ 

^ These numbers are estimated from 

http://terapix.iap.fr/cplt/T0007/doc/T0007-doc.pdf 
^ http://terapix.iap.fr/cplt/T0007/doc/T0007-doc.pdf 
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unique combination of wide imaging with deep sensitiv¬ 
ity. It is a precursor to the ongoing wide imaging surveys 
such as the Dark Energy Survey (DES), Kilo Degree Sur¬ 
vey (KiDS) and the Hyper Suprime-Cam (HSC) survey and 
other planned future surveys such as the Large Synoptic 
Survey Telescope (LSST) survey. The search for lenses with 
Space Warps in the CEHTLS is an important step to learn 
lessons and prepare for lens searches in these larger imaging 
surveys. 

2.2 Previously published lens samples from the 
CFHTLS 

The CEHTLS has been searched for lenses using various 
lens finding methods and algorithms. Here, we give a brief 
summary of previously published lens samples in the chrono¬ 
logical order. 

Erom the early release of the CEHTLS (T0002) covering 
28 deg^, Cabanac et al. (2007) used an arc finding algorithm 
(Alard 2006) to find arcs in galaxies, groups and clusters. 
They found about 40 lens candidates with quality grades 
from low to high. 

In the thesis dissertation of Thanjavur (2009), 9 promis¬ 
ing and 2 low probability candidates were reported as having 
been discovered serendipitously. These detections were made 
during the visual inspection of the CEHTLS images as part 
of data reduction procedures for the Weak Leasing survey 
(Benjamin et al. 2007). 

Sygnet et al. (2010) carried out a search for edge-on 
disk galaxy lenses in the CEHTLS WIDE. They identified 
galaxies, using S Extractor, which had 18 < z < 21 and 
inclination angle < 25 °. After applying few more selection 
criteria and visual inspection, they found about 3 promising 
and a total of 18 lens candidates. 

The ArcEinder (More et al. 2012) was used for finding 
blue arc-like features in the entire CEHTLS imaging with¬ 
out any pre-selection on the type of the lensing object. This 
ArcEinder, an improved version of the algorithm by Alard 
(2006), measures the second order moments of the flux dis¬ 
tribution in pixels within small regions to estimate the di¬ 
rection and extent of local elongation of features. Pixels with 
high values of elongation are connected to form an arc can¬ 
didate. Einally, a set of thresholds on arc properties such 
as the area, length, width, curvature and surface brightness 
are used to select arc-like candidates. The search was carried 
out in the ^-band which is the most efficient wavelength to 
find typical lensed features. This sample, called SARCS, has 
55 promising and a total of 127 lens candidates which are 
selected from both CEHTLS WIDE and DEEP fields. The 
SARCS sample consists of some galaxy-scale candidates and 
mostly groups/cluster scale lens candidates. This is because 
more massive systems produce arcs or lensed images with 
large image separation from the lensing galaxy which are 
easier to detect compared to the galaxy-scales. In the ab¬ 
sence of a large systematically followed up verified sample 
of candidates, we choose the most promising 26 systems as 
our bona fide lens sample from the CEHTLS WIDE. The to¬ 
tal number of lens candidates in the CEHTLS WIDE alone 
is 108. 

In Elyiv et al. (2013), the authors visually inspected 
a sample of 5500 optical counterparts of X-ray point-like 
sources identified in the XMM-LSS imaging of the CEHTLS 


W1 field. The goal was to find instances of lensed quasars. 
Their sample consists of a total of 18 candidates, of which 
3 candidates were found to be promising. 

Gavazzi et al. (2014) used their RingEinder code 
to find compact rings or arcs around centres of isolated 
and massive early-type galaxies. RingEinder subtracts 
the point spread function (PSE)-matched z-band images 
from the ^-band images, and looks for excess flux in the 
bluer ^-band. An object detector measures the proper¬ 
ties of these residual blue features, and candidates which 
meet the length-width ratio and tangential alignment cri¬ 
teria are then visually inspected to form the final sample. 
Gavazzi et al. (2014) pre-selected ~638,000 targets as ei¬ 
ther photometrically-classified early type galaxies, or ob¬ 
jects selected to have red centres and blue outer parts, from 
the T0006 CEHTLS data release catalogs. A total of 14370 
galaxies were found to show detectable blue residuals, and 
2524 were visually inspected, having passed the automatic 
feature selection process. This led to a total of 330 lens can¬ 
didates out of which 42 were deemed good quality (q_flag 
= 3) and 288 medium quality (q_flag = 2) candidates. In 
addition to the main well-defined sample of Gavazzi et al. 
(2014), a further 71 candidates were reported to have been 
detected by earlier versions of the RingEinder, or from the 
CEHTLS DEEP. Erom the main sample of “RingEinder 
candidates,” the SL2S team found, during their follow-up 
campaign, 33 confirmed lenses (Sonnenfeld et al. 2013b,a). 

The work by Maturi et al. (2014) used the arc find¬ 
ing code of Seidel & Bartelmann (2007) and colour proper¬ 
ties of typical arcs to optimize arc detection. This new ap¬ 
proach was tested on the CEHTLS-Archive-Research Sur¬ 
vey (CARS, Erben et al. 2009) which covers an area of 
37 deg^ only, and this entire image set was also visually 
inspected by the authors to estimate the completeness and 
purity of their robotic search. They found 29 candidates with 
the robotic search alone and 41 candidates through pure vi¬ 
sual inspection—some of which were known from previous 
searches. Most of these candidates are medium-low proba- 
bility^ 

The RingEinder and the ArgEinder searches are the 
only searches that make use of a lens finding algorithm 
and that have been run on the entire CEHTLS imaging 
dataset. Thus, we considered these to be our reference sam¬ 
ple of known lenses from robotic searches. Eor the pur¬ 
poses of transparency and to help with the training, the vol¬ 
unteers participating in Space Warps-CEHTLS lens search 
were made aware of these two known lens samples. Images 
containing the systems from the RingEinder and the Ar¬ 
gEinder samples were labelled as “known lens candidates” 
in the Space Warps discussion forum. Talk,® where volun¬ 
teers have the opportunity to discuss their findings with fel¬ 
low volunteers and the science team. In this paper, we refer 
to the sample of 330 RingEinder and 108 ArgEinder lens 
candidates as the sample of “known lens candidates” and the 
sample of confirmed (or most promising) 33 RingEinder 
and 26 ArgEinder as the sample of “known lenses”. Note 
that the “known lens” sample is a subset of the “known lens 
candidates” sample. Also, note that the lens candidates from 


^ http: //www. ita. uni-heidelberg. de/~maturi/Public/arcs 
® http://talk.spacewarps.org/ 
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the other papers listed above were not included in our refer¬ 
ence “known” sample and were not labelled as such in Talk. 
However, we did exclude these candidates when compiling 
the list of new Space Warps lens candidates, as described in 
Section 5.2. 


2.3 Image presentation in Space Warps 

In order to perform a blind lens search over the entire 
CFHTLS WIDE, we present the volunteers with cutouts of 
images selected from the survey region. We briefly describe 
the image presentation here for completeness; more infor¬ 
mation can be found in Paper I. We use the r and z-band 
imaging from CFHTLS which are most useful for visual iden¬ 
tification of lenses. We made colour composite images using 
the publicly-available code, HumVI^ following the prescrip¬ 
tion of Lupton et al. (2004). The colour scales were chosen 
to maximize the contrast between faint extended objects 
and bright early type galaxies. These parameters were then 
fixed during the production of all the tiles, in order to allow 
straightforward comparison between one image and another, 
and for intuition to be built up about the appearance of stars 
and galaxies across the survey. 

We extracted contiguous cutouts of size 82'' (440 pix¬ 
els), including overlapping region of 10" (54 pixels) between 
the neighbouring cutouts. This resulted in a catalog of some 
430,000 cutouts for the entire CFHTLS WIDE region. The 
size of the individual cutout was determined by optimising 
factors such as the typical angular scales of gravitational 
lenses, the number of objects seen in a single cutout and 
the total number of image cutouts in the survey. If a lens 
candidate happens to be too close to the edge of a cutout, 
then the overlap between neighbouring cutouts allows a vol¬ 
unteer to get a clearer view of the same candidate in at least 
one of the cutouts. We note that since the images are shown 
randomly, a volunteer may not necessarily come across the 
neighbouring cutout unless they classify a large number of 
images. This is not a problem since our user base is ex¬ 
tremely large and we receive multiple classifications for the 
same cutout. 


3 TRAINING SAMPLE 

The simulated lenses are important to train citizen scientists 
who may be new to the task of finding lenses, but they are 
also crucial for analysing the classihcations performed by 
the citizen scientists (more details can be found in Paper I, 
but see Section 4 below for a brief summary). In this section, 
we describe the framework used for generating the simulated 
lens sample, give details of the sample itself along with some 
of its known limitations and also describe the sample of duds 
and impostors. 


^ The open source colour image composi¬ 
tion code used in this work is available from 
http://github.com/drphilmarshall/HumVI 


3.1 Methodology to simulate lenses 

For the purpose of generating simulated lens systems, we 
divide them into two main categories a) galaxy scale lenses 
b) and group or cluster-scale lenses. We further subdivide 
galaxy scale lenses based on the type of the background 
sources, namely galaxies and quasars. We do not simulate 
group-scale quasar lenses as they are expected to be even 
more rare. We now describe our procedure to generate these 
different types of lens systems. 


3.1.1 Galaxy-scale lenses 

We begin by considering all elliptical galaxies at z < 1 in our 
parent CFHTLS catalog (Gavazzi et al. 2014) as potential 
lens candidates for the simulated sample. To avoid using a 
known lens galaxy for our simulation purpose, we exclude 
all those galaxies whose positions match with the leasing 
galaxies from the SARCS samples within 2 arcsec®. 

For each galaxy, the average number of source objects 
(either quasars or galaxies) above a minimum luminosity 
Lmin in the background that may get leased can be calcu¬ 
lated as 

Asrc — / nsrc(^ L Yn.in •> Zs ) O'vi Zs ~ d^s (l) 

Jz, dz, 

where 

noo 

n-src (> Lmin5 Zs) — / ^{L',Zs)AL'. (2) 

-^min 

Here, 4>(L',Zs) denotes the source luminosity function 
per unit comoving volume, criens denotes the angular lens 
cross-section, which depends upon the lens redshift (z^), 
source redshift (^s), the lens velocity dispersion Oy as well 
as the projected axis ratio of the lens ellipticity, q. 

In order to calculate the leasing cross-section, we first 
calculate the luminosity of each potential leasing galaxy us¬ 
ing the photometric redshifts {zi) from the parent galaxy 
catalog. Next, we use the L — o scaling relation from the 
bright sample of (Parker et al. 2005) given by 

/ r \ 1/3 

cr„ = 142 (—j Kms“\ (3) 

This sets the velocity dispersion of the halo hosting the 
galaxy, which will be later used in the model. We assume 
that the knee of the luminosity function of galaxies, L* 
evolves such that there is a decline of 1.5 magnitudes be¬ 
tween z — 1 to z = 0 (Faber et al. 2007).^ 

We adopt a singular isothermal ellipsoid (SIE) model 
for each of our galaxies (Kormann et al. 1994), such that 


® Due to inaccuracies and uncertainties in measurements of the 
centres of some of the lens candidates, some simulated lensed 
images were superposed on the galaxies of known lens candidates. 
This issue was overcome by presenting the same CFHTLS images 
with and without the simulated lenses to the volunteers. 

® We anchor our L* evolution at low redshifts using the de¬ 
termination of L* in the r-band by Blanton et al. (2001). To 
maintain consistency in magnitude systems, we have converted 
the CFHT MegaCAM magnitudes to SDSS magnitudes and k- 
corrected them to z = 0.1. 
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Figure 1. Examples of the three types of simulated lenses. 


the convergence is given by 


K,{x,y) = 


1 


( 4 ) 


2 {0l + q^0lf 

Here, b is called the Einstein radius, and its dependence on 
the velocity dispersion of the SIE is given by 


b = 47r ( 


D, 


( 5 ) 


The SIE model results in a caustic and a pseudo-caustic on 
the source plane: which demarcate the regions of different 
image multiplicities. We make use of the parametric solu¬ 
tions, r{0), for the caustics in such a model from Keeton 
et al. (2000b) where 0 is the polar angle. We take the maxi¬ 
mum of the radial and tangential caustic at every polar angle 
in order to obtain the area of the leasing cross-section, aiens, 
for every galaxy, 

r2 y-27r 

CTlens = ^ J r‘^{0)d0. (6) 

We also add external shear at the centre of the potential 
leasing galaxy drawn randomly from a set range (see Table 1. 
The shear is expected to affect the lens cross-section for a 
small number of cases when the shear strength is high in 
addition to high lens ellipticity or the PA of the shear is 
almost orthogonal to that of the lens ellipticity. However, 
the effect of shear on the lens cross-section is expected to 
be small for most of the cases and is ignored in the current 
implementation of SIMCT. 

The luminosity functions of the background galaxies 
and quasars are determined as follows. We use the results 
of Faure et al. (2009) to specify the luminosity function of 
galaxies where the redshift distribution of sources is given 
by 


Ps = 


iSzjexpi 

r(3//3)2:g(miim) 


( 7 ) 


where = 3/2 and zo(miim) = 0.13 mum — 2.2 and the 
source counts as a function of the limiting magnitude are 
given by 

_no dm_ 

yyiQ2a(mi-m) _|_ ]^Q26(mi-m) ’ 

with parameters a = 0.30, b = 0.56, mi = 20 and no = 
3 X 10^ deg -‘^. 

For quasars, we assume the luminosity function pre¬ 
scription of Oguri & Marshall (2010) and adopt k-corrections 
by Richards et al. (2006). 


/ V 

-c 


The luminosity function is expressed as 


d$ _ 

dM “ 10O.4(a + l)(M^bs-M.) 10O-4(^ + l)(^abs-M.) 


( 9 ) 


where the normalization, 0* = 5.34 x 10~^h^ Mpc“^ and 
break magnitude, M* = —20.90 + 51og/i — 2.b\ogf{z). The 
redshift dependent factor in is given by 


/(^) 


(\/e^ + ' 


( 10 ) 


We adopt the best-fit values C = 2.98, = 4.05, z* = 1.60 

(Oguri h Marshall 2010). For the faint end slope, we use 
/3 = —1.45 whereas for the bright end slope, we use ol — 
—3.31 when Zg < 3 and a = —2.58 at higher redshifts, as 
prescribed by Oguri & Marshall (2010). 

With the cross-section, and the luminosity functions 
specified, we calculate the expected number of sources be¬ 
hind a candidate leasing galaxy using Equation 1. We 
need to generate a large number of simulated lenses (larger 
than the number of real galaxy lenses we expect to find 
in CFHTLS) in order to have a reasonably large and di¬ 
verse training sample for thousands of Space Warps volun¬ 
teers. Therefore, we artificially boost the average number of 
sources by a factor (see Table 1), which increases the oc¬ 
currence of lensing. We draw a Poisson deviate, Nsvc with 
a mean equal to the boosted average number of sources. If 
iVsrc is greater than zero, then this galaxy is flagged as a 
potential lensing galaxy. 

Next, we determine properties of the background 
sources for every lens system. We follow similar procedures 
for both background galaxies and quasars. We draw source 
redshifts and luminosities from the aforementioned distribu¬ 
tions. We note that the sources are being drawn from a much 
fainter magnitude range compared to the limiting magnitude 
of the CFHTLS imaging and thus, the magnification bias^° 
is naturally taken into account. The source positions with 
respect to the lens are drawn randomly from an area inside 
the caustic. When populating the sources within the caus¬ 
tics, the finite size of the background galaxies is expected 
to affect the lens cross-sections to some extent. As this fac¬ 
tor is not critical for the purpose of our training sample, for 


In a flux-limited sample from a survey, sources fainter than 
the flux limit end up in the sample owing to the magniflcation by 
lensing which is known as the magniflcation bias. This affects the 
source luminosity function and needs to be accounted for when 
comparing the true and observed luminosity functions. 
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simplicity we assume the background galaxies to be point 
like when computing cross-sections. We perform ray-tracing 
for all of the iVsrc sources using the publicly available code 
GRAVLENS (Keeton et al. 2000a) and choose sources that 
satisfy our selection criteria given below. We determine the 
fluxes of the lensed images and the total magnification of 
each of the lensed sources. We draw a source randomly for 
which the flux of the second brightest lensed image and the 
total magnihcation of all lensed images meet the thresholds 
given in Table 1. 

Since we want to produce realistic looking lens sys¬ 
tems, we simulate lenses in each of the five CFHTLS filters. 
The colours of the background galaxies are drawn randomly 
from the photometric CFHTLenS catalog (Hildebrandt et al. 
2012; Erben et al. 2013). Similarly, we use a quasar catalog 
from the SDSS Data Release 9 (Paris et al. 2012) from which 
colours are drawn to simulate quasar lenses. Next, we assume 
a Gaussian profile^^ for the galaxies. The ellipticity and the 
position angle (PA) are drawn randomly from within the 
range given in Table 1. The effective radius of the galaxy is 
estimated from the luminosity—size relation (Bernardi et al. 
2003, with a redshift scaling, to account for size evolution) 
given by 

. 2/3 

Kpc (11) 

{l + Zsf 

where Lr — Tg/lO . On the other hand, quasars are as¬ 

sumed to be point sources and the PSF, with which quasars 
are convolved, is assumed to have a Gaussian profile. The 
full-width-at-half-maximum of the PSF is equated to that 
of the mean seeing for every filter. The mean seeing values 
are taken from Table 4 of the official Terapix T0007 release 
explanatory document 

Once all the parameters are determined for the lens 
and source models, we once again use GRAVLENS to generate 
simulated lensed images. After accounting for the shot noise 
in the lensed images and convolving them with the seeing 
in each of the hlters, the simulated image is added to the 
real GFHTLS image centred on the galaxy chosen to act as a 
lens. Note that we ensure that the lensed galaxies and lensed 
quasars are not superposed on the same “leasing” galaxy. 
Similarly, these “leasing” galaxies at the galaxy scales are 
ensured to be distinct from those chosen for the group scales. 
The framework for the group-scale is described below. 


3.1.2 Group-scale lenses 

At group or cluster-scales, the mass distribution is more 
complex. The convergence in the inner regions, which are 
typically responsible for the multiple lensed images, arises 
from not only the brightest group galaxy (BGG) at the cen¬ 
tre, but also from the dark matter component and the satel¬ 
lite galaxies (Oguri et al. 2005; Oguri 2006). We generate 
a basic group catalog based on the magnitudes and pho¬ 
tometric redshifts available for the GFHTLS. We select all 
galaxies with lO^^'^M© as plausible BGGs. We select the 


This was due to an oversight. We intended to use either an 
exponential or de Vaucouleurs’ profile that will be adopted for 
future implementations of SIMCT. 

http://terapix.iap.fr/cplt/T0007/doc/T0007-doc.pdf 
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Figure 2. Einstein radius and total magnification distributions 
for all types of lenses. The solid (blue) curves show the theoretical 
prediction assuming an SIS model at galaxy-scales and a total 
(NFW+Hernquist) model at group scales taken from (More et al. 
2012 ). 


member galaxies such that their photometric redshifts are 
within 5z — 0.01 of the BGG and within an aperture of 
250 Kpc. If another BGG is found within the aperture, then 
the fainter BGG is removed from our list of BGGs. 

We assume a constant mass—to—light ratio of 3 x 
0.7 h to convert the BGG luminosity to a stel¬ 

lar mass estimate. The stellar mass—halo mass relation 
(Behroozi et al. 2013), including random scatter, is then 
used to calculate the halo mass for the lens. We adopt an 
NEW (Navarro et al. 1997) density profile for the underlying 
dark matter halo. Given the halo mass, other key parame¬ 
ters such as the scale radius (r^) and the density at the 
scale radius {ps) can be determined for an NEW profile. In 
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Figure 3. Distributions of properties of the “lensing” galaxies of the simulated sample compared to the known lens sample SARCS. 


Table 1. Thresholds used in the selection of the simulated lenses. 


Name 

Gal-Gal (Grp-Gal) 

Gal-Qua 


min 

max 

min 

max 

Source Redshift 

1.0 

4.0 

1.0 

5.9 

Source Flux 

21.0 

25.5 

21.0 

25.5 

Source ellipticity 

0.1 

0.6 

- 

- 

Source PA 

0 

180 

- 

- 

Lens Redshift 

- 

0.9 

- 

0.9 

Lens shear strength 

0.001 (-) 

0.02 (-) 

0.001 

0.02 

Lens shear PA 

O(-) 

180 (-) 

0 

180 

Einstein radius (arcsec) 

1.2 (2) 

5 (-) 

1.2 

5 

boost factor 

=100 (40) 


=1200 


Image F1 ux2b 

>23 


>23 


Image Fluxtot 

<19 


<20 



a) 0 - corresponds to quantities used for Grp-Gal scale lenses, 
if they are different from Gal-Gal. b) 2 B “ the second brightest 
lensed image, c) tot “ total flux integrated over all of the lensed 
images, d) All fluxes are in AB mag. PA is in degrees measured 
East of North. 

addition, we adopt an SIE model for the BGG and mem¬ 
bers whenever the ellipticities are available from the galaxy 
catalog (else we use an isothermal sphere, SIS). 

We calculate the luminosities and velocity dispersions 
for the BGG and each of the member galaxies following the 
same prescription as in Section 3.1.1. To calculate the aver¬ 
age number of sources that get lensed by such a system, we 
need to calculate the lensing cross-section for each of these 
potential lensing groups. The complexity in the lens models 
makes it analytically intractable to calculate the size of the 
caustics^^. Therefore, we generate the caustics numerically 
using GRAVLENS and then determine the area covered by the 
caustics. We consider only galaxies as our background source 

The lens mass distribution determines size and shape of the 
caustics. Any source located within the caustics will form multiple 
lensed images which is the criteria for strong lensing. To further 
understand caustics, see e.g., Schneider et al. (1992). 


population since group or cluster-scale quasar lenses are ex¬ 
pected to be extremely rare in the GFHTLS. Following the 
same procedure as described in Section 3.1.1, we calculate 
the number of galaxies expected to lie behind every poten¬ 
tial lensing group (see Equation 1). As before, for each back¬ 
ground galaxy within the lens cross-section, a redshift and 
an z-band magnitude is determined by drawing galaxies ran¬ 
domly from the respective distributions (see Equations 7-8). 

All those groups that are found to have no background 
galaxies within the cross-sectional area are rejected and the 
rest are included as potential lenses. As mentioned earlier, 
we artificially boost the total number of sources behind ev¬ 
ery lens but ensure that the statistical properties such as the 
profile of the image separation distribution are not affected 
(see Figure 2). We follow the same procedure and apply 
thresholds to determine properties of the lensed galaxies for 
every lens as described for galaxy-galaxy lenses in the previ¬ 
ous section. The thresholds are same as those used for galaxy 
lenses (see Table 1) and are reported within “()”5 differ¬ 
ent for group scales. The simulated lensed images are then 
added to the real GFHTLS images with the BGGs as the 
centre by following exactly the same procedure as described 
in the previous section. 

3.2 Simulated lens sample and catalog description 

In this section, we describe some of the properties of our sim¬ 
ulated sample for each of the three types of lens samples. We 
have made an attempt to generate as realistic a lens sam¬ 
ple as possible within the requirements of the Space Warps 
project. The statistical properties of the lens sample are ex¬ 
pected to be similar to real lens samples. 

In Figure 2, we show the Einstein radius (Re) distribu¬ 
tion for the galaxy-scale and group-scale simulated lenses. 
For comparison, we give the expected distributions (blue 
solid curves) for an SIS like density profile at galaxy-scales 
and an NFW+Hernquist profile at group scales. The theo¬ 
retical curves are taken from More et al. (2012) wherein the 
models are explained in detail. We note that the model we 
adopt at the group scale also includes SIS or SIE compo¬ 
nents for the group members unlike the theoretical predic- 
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tion. The theoretical curves have arbitrary normalizations. 
We also show the distribution of the total magnification for 
all three samples. 

Next, we consider the redshift, magnitude and elliptic- 
ity distributions of the “lensing” galaxies from the simulated 
sample as shown in Figure 3. For reference, we also show 
SARCS lenses from More et al. (2012), with arbitrary nor¬ 
malizations. We find that the properties of the foreground 
lenses in the simulated and the real lens samples are broadly 
similar. 

We produce catalogs with lens and source properties 
for each of the three types of simulated lenses. The catalogs 
typically have lens position, redshift, magnitudes, Einstein 
radius, ellipticity (whenever available) and shear (for galaxy- 
scale lenses only). For the background sources, we provide 
the offset from the lens centre, redshift, magnitudes, total 
magnification, number of lensed images. Additionally, when 
possible, ellipticity and effective radius of the background 
galaxies have also been provided. These catalogs are avail¬ 
able from https://github.eom/cLnupreeta27/SIMCT and 
the simulated lens image sample is available from the au¬ 
thors on request. 

3.3 Limitations of the simulated lens sample 

The simulated lens sample, although realistic, is not perfect, 
due to the simplicity of the lensing models and our limited 
understanding of the uncertainties in the model parameters. 
Comments from citizen scientists were very helpful in order 
to identify some of these failures, which make up roughly 
5% of the simulated sample.Here, we describe some of 
the cases or aspects in which the simulations were known to 
have failed to look realistic. 

The parameters required by the models’ various scaling 
relations primarily depend on the photometry of the galax¬ 
ies, groups and quasars detected in the survey. For galaxy- 
scale lenses, the fainter or higher redshift galaxies, chosen 
to act as lenses, tend to have poor photometric redshift 
measurements. Consequently these galaxies were occasion¬ 
ally assigned the wrong luminosity and velocity dispersion 
estimates, resulting in simulated lenses which look implau¬ 
sible or unrealistic. For example, the lensed images for some 
of the failed simulations have larger image separation than 
what one would expect from the luminosity and/or size of 
the galaxy. We roughly expect mass to follow light, so more 
massive galaxies typically look brighter and/or bigger. 

At group scales, the magnitudes and photometric red- 
shifts were used when defining the group membership. 
Therefore, errors in redshift estimates occasionally gener¬ 
ated galaxy groups having member galaxies with unrealisti¬ 
cally dissimilar properties. In some cases, low redshift spiral 
galaxies were incorrectly assigned high redshift. Spiral galax¬ 
ies are typically less massive and low redshift spiral galaxies 
are unlikely to act as gravitational lenses. Hence, some such 
instances did not appear convincing, as the lensed images 
again did not have the expected configurations or separa¬ 
tions. 

We also use a single component to describe the light 

This estimate is based on the number of ^^simfail tags from 
TALK, the discussion forum. 


distribution of the background galaxies. This is clearly not 
the most accurate description for galaxies, especially for the 
irregular star-forming galaxies which comprise a significant 
fraction of the lensed galaxy population. Star-forming galax¬ 
ies have complex structures such as star forming knots, spiral 
arms, bars and disks. The simulated lensed images do not 
display these features. This is not problematic for most of 
the images taken from ground based telescopes such as the 
CFHT but sometimes the profiles of the lensed arcs can ap¬ 
pear very symmetric (along the length or width of the arc) 
and featureless, especially, if the images are very bright. 

3.4 Duds and Impostors 

Citizen scientists need training not only to identify gravita¬ 
tional lenses, but also to reject images which either contain 
no lenses, or contain objects which could be mistaken for 
lenses. Hence, in addition to the simulated lenses, we added 
a sample of “duds” and “impostors” to the training sam¬ 
ple. Duds are images which have been visually inspected by 
experts and confirmed to contain no lenses. Impostors are 
systems which have lens like features but are not lenses in 
reality, for example, spiral galaxies, star-forming galaxies, 
chance alignments of features arranged in a lensing configu¬ 
ration and stars. 

We selected a sample of 450 duds for the Stage 1 clas¬ 
sification in Space Warps and a sample of 500 impostors 
for the Stage 2 inspection. The sample of impostors was 
selected from the candidates which passed the Stage 1 of 
Space Warps. We note that this is the first time we have 
a systematically compiled sample of visually inspected im¬ 
postors by the Space Warps volunteers and categorized by 
the science team. We produced an additional larger sam¬ 
ple of a few thousand false positive detections by scan¬ 
ning through the low probability images after the comple¬ 
tion of Stage 2. All of these data products will be made 
available at http://spacewarps. 0 rg/#/projects/CFHTLS/. 
Such a sample has tremendous utility for training and test¬ 
ing of various lens finding algorithms (e.g., Chan et al. 2014). 


4 METHODOLOGY TO PRODUCE THE 

Space Warps-CFHTLS LENS SAMPLE 

Space Warps works as a single unified system which uses the 
method of visual inspection to find gravitational lenses. For 
the first Space Warps lens search, the volunteers were shown 
images at two stages. At Stage 1, volunteers were asked to 
carry out a rapid inspection to select lens candiates ranging 
from possible lenses to almost certain lenses. At Stage 2, vol¬ 
unteers were asked to inspect the candidates from Stage 1 
and select only promising lens candidates. A daily snapshot 
of the classifications performed by volunteers was provided 
to the science team every night. This daily batch was anal¬ 
ysed by the Space Warps Analysis Pipeline (SWAP). The 
philosophy and the details of SWAP are described in detail 
in Paper 1. Here, we briefly summarize how it works. 

Each subject (image cutout) is assigned a prior proba¬ 
bility of 2 X 10“^ of containing a lens system. Every volunteer 
is assigned an agent characterised by a 2 x 2 confusion ma¬ 
trix A4, which quantifies the volunteer’s ability to correctly 
classify an image as containing a lens {Mll = Pl) or not 
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p 


Figure 4. Distribution of different types of candidates as a func¬ 
tion of the posterior probability P, obtained at the end of Stage 2. 
The types of the candidates are the false positives (FPs), the 
new candidates and the known candidates. The new and known 
candidates have higher detection rate for higher values of P, as 
expected. 


containing a lens {Mnn = Pd)- The values of these con¬ 
fusion matrix elements are determined based on the perfor¬ 
mance of the volunteer on the training sample, specifically, 
Pl (and Pd) is determined based on the fraction of simu¬ 
lated lenses (and, respectively, duds) correctly classified. Af¬ 
ter every classification, the agent updates the probability of 
the classified subject based on the volunteer’s classification 
and the confusion matrix, according to Bayes’ theorem. The 
agent’s confusion matrix is updated after the classification 
of every training image. The thresholds for the probabilities 
to accept or reject a subject if it contains a lens or does 
not contain one can be chosen in SWAP. In Stage 1, those 
images which cross these threshold values are “retired”, and 
are not subsequently shown to the volunteers. In this way, 
the crowd can use its time efficiently in inspecting previously 
unclassified subjects. 

SWAP was run nightly during Stage 1 in order to re¬ 
tire subjects and inject new ones in to the classification 
stream. The subjects that passed the detection threshold at 
the end of Stage 1 were served again at Stage 2 for careful re¬ 
inspection. The goal of Stage 1 inspection was to maximise 
completeness and that of Stage 2 was to maximise purity. 
Each subject, after Stage 2, has a final posterior probabil¬ 
ity P. In the ideal case, all images containing lenses will 
have high P values and those without lenses will have low 
P values. In practice, we expect a small fraction of the real 
lenses (or non-lenses) to be assigned low (or high) P values, 
thereby decreasing the completeness (or purity) of the final 
lens sample. As this is the first lens search with Space Warps, 
we want to find a threshold P value which will result in ac¬ 
ceptable levels of completeness and purity of the final sample 
of lens candidates. 


Table 2. Statistics of detections in Space Warps 



Stage 1 

Stage 2 


KG 

KL 

KG 

KL 

Number 

142 

39 

79 

34 

Fraction 

of total recovered 

32% 

66% 

18% 

58% 

-Faccthresh 

0.95 

0.95 

0.3 

0.3 

Averaged Grade (G) 

- 

- 

1.3 

1.3 


Stage 2 




NG 

AG 



Number 

59 

141 



Fraction 

of detections 

14% 

33% 



-Faccthresh 

0.3 

0.3 



Averaged Grade (G) 

1.3 

1.3 




KC- Known lens candidates 

KL- Known lenses 

NC- New lens candidates 

AC- All (known and new) lens candidates 

-Faccthresh “ Systems with Posterior probability P above this 
threshold are selected 

Note: For KC and KL, Percentages are with respect to the known 
population (i.e. 438 KC and 59 KL see Section 2.2) whereas for 
NC and AC, percentages are with respect to the total sample of 
429 lens candidates. 


To achieve this, we selected a total of 665 subjects with 
P > 0.3 at Stage 2 which were then visually inspected by 
three of us (as “lens experts”, AM, AV, PJM). Each images 
was assigned a grade on a scale of 0 to 3, representing those 
images 0) unlikely to contain a lens, 1) possibly containing 
a lens, 2) probably containing a lens and 3) almost certainly 
containing a lens. The final sample of Space Warps-CEHTLS 
lens candidates was then produced by selecting candidates 
above a threshold on the averaged grade G, as described in 
the next section. 

5 RESULTS 

5.1 Space Warps-CFHTLS lens sample 

In this section, we describe the Space Warps candidate lens 
sample from the CFHTLS. We find a total of 141 can¬ 
didates with G ^ 1.3 (medium-high grade), of which 59 
are new systems. This sample is further divided as fol¬ 
lows. We have a total of 50 candidates with 1.3 ^G< 2 
(medium grade), of which 30 are new. The quality of can¬ 
didates in this category is such that at least one of the in¬ 
spectors (“lens expert”) thought the candidate was prob¬ 
ably a lens (that is, a grade of 2) and a second inspector 
thought that it was possibly a lens (that is, a grade of 1). 
Among our high grade sample {G ^2), there are a total 
of 91 candidates, of which 29 are new. In this category, 
the minimum grade by all of the inspectors was 2, suggest¬ 
ing that the candidates are probably or almost certainly 
lenses according to all three inspectors. To avoid duplica¬ 
tion, only the newly discovered lens candidates with 
1.3 (medium-high probability) are presented in this paper 
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(see Section 5.2), and further information on Space Warps- 
selected candidates that were previously identified in the lit¬ 
erature (as described in Section 2.2) will be made available 
at http://spacewarps.org/#/projects/CFHTLS/. 

We also find a total of 288 (and 245 new) candidates 
with averaged grade 0 <G< 1.3 (low grade), which means 
that at least one of the inspector thought the candidate was 
possibly a lens, and in the best cases, all three inspectors 
thought the candidate was possibly a lens (that is, a grade 
of 1)^^. Further information on the low probability sam¬ 
ple such as their positions and images will be available at 
http://spacewarps. 0 rg/#/projects/CFHTLS/. Note that if 
all of the inspectors gave a grade of 0 to a candidate, then 
it was discarded from the sample. 

In Table 2, we give overall statistics of the systems de¬ 
tected at Stage 1 and Stage 2. We give the total number of 
detections of the known lens candidates, known lenses and 
the new lens candidates at each stage. We also give the re¬ 
covery fractions for the known samples and fraction of total 
detection for the new samples at each stage. Overall, the 
sample of new Space Warps lens candidates comprises over 
40 per cent of the total Space Warps —CFHTLS lens can¬ 
didates. We find that 90 per cent of the confirmed lenses 
found at Stage 1 are also recovered at Stage 2. However, 
~ 35 per cent of the known lenses were missed already at 
Stage 1: we return to the discussion of these false negatives 
below in Section 6.3. Nearly 45 per cent of the known lens 
candidates from Stage 1 are rejected at Stage 2. Such frac¬ 
tions are acceptable for “candidates” as their quality grades 
vary from high to low. 

In Figure 4, we plot the distribution of false positives 
and the high grade lens candidates, as a function of the P 
value assigned by SWAP at the end of Stage 2. On aver¬ 
age, the fraction of lens candidates is indeed an increasing 
function of P. This shows that the Space Warps generated P 
values for the subject are roughly correlated with the expert 
grades albeit with quite some scatter. We note that below 
P ~ 0.75, the fraction of false positives starts to exceed 
the fraction of real lens candidates. This could be a good 
threshold to choose to maximize the purity of the final sam¬ 
ple. However, choosing P = 0.3 gives a completeness of 92% 
for the “known lens” sample instead of 64% for P = 0.75. 
Therefore, the new sample should also have increased com¬ 
pleteness; expert grading then allows us to increase the pu¬ 
rity of the sample. 

5.2 New lens candidates from Space Warps 

We give basic information about the final sample of 59 new 
medium-high grade lens candidates found by Space Warps 
in Table 3. We report the candidates with a Space Warps 
ID and Name of the lens system. We give their positions 
(RA, Dec), photometric redshift (^phot), Tband magnitude 
of the lensing galaxy, averaged grade G from the lens ex¬ 
perts, zoo ID (identifier used in TALK), P value at Stage 2 
and a visual categorization of the type of lensed images and 
the lensing galaxy in the “Comments” column in Table 3. 

If grades from the inspectors were found to be discrepant by 
2 or more, these were discussed and re-graded to resolve the dis¬ 
crepancy. 


Whenever available the lens properties are taken from the 
CFHTLS photometric catalog (Coupon et al. 2009); other¬ 
wise, for the lens galaxy positions, the reported values were 
measured manually. The visual categorization of the lens 
type is only suggestive and the explanation of the notations 
in the Comments column is given at the bottom of the table. 

We show images of our new sample in Figure 5. The 
panels are arranged first in the descending order of their 
grades, and within each grade, in ascending order of RA. As 
the first lens search was a blind search with no pre-selection 
of candidates from any algorithm, we find various types of 
lenses, as expected from such a search. The final sample con¬ 
sists of both galaxy and group-scale lens candidates. There 
are detections of elongated arcs and some interesting point¬ 
like quasar lensed images. Most of them are brighter in the 
bluer g band, but some candidates brighter in the redder i 
band are also found. Since the robotic lens searches focused 
on the blue lensed features, they are likely to miss such in¬ 
teresting lens candidates. We did not find any examples of 
exotic lens candidates from the visually inspected P > 0.3 
sample. There may be some more interesting candidates that 
were missed either at Stage I or Stage 2 but have been iden¬ 
tified by volunteers in Talk. This resource is yet to be mined 
and is left for future work. 

The new Space Warps lens sample presented here illus¬ 
trates some of the advantages of having citizen scientists 
find lenses through visual inspection. An algorithm, by def¬ 
inition will find objects that adhere to a selection criteria 
that uses either geometry or flux information from an im¬ 
age. On the other hand, citizen scientists can interpolate 
over or extrapolate beyond the basic selection criteria pro¬ 
vided to them. For example, the lower blue arc in SW7 is 
split by a small red galaxy. An algorithm typically fails to 
detect such arcs because the arc is broken into smaller ar- 
clets which then falls below the minimum length or area 
allowed for an arc to be detected. Human inspectors have 
no problem in interpolating over the broken blue arc over 
the red galaxy, understanding that it is a single long arc. 
The system SW20 has point-like lensed images which can¬ 
not be detected by arc finding algorithm, whereas the ring 
finding algorithm may have missed this because of the atyp¬ 
ical colour and structure of the lensing galaxy. Detection of 
red arcs, for example, as seen in the SW39 candidate, shows 
how the volunteers extrapolate on the colour parameter: the 
training sample contains predominantly blue arcs, because 
the source colours were drawn from realistic observed distri¬ 
butions. 

The power of citizen scientists also lies in the high dy¬ 
namic range that allows us to find systems which have very 
short (thick) to long (thin) arcs, from highly compact to 
low surface brightness images, from round and point-like to 
elongated and curved images, from blue to red, from regular 
to exotic kinds of lenses; while keeping the false positive rate 
low compared to algorithms. Discovery of this large sample 
of completely new candidates missed by some of these algo¬ 
rithms demonstrates that the Space Warps system is func¬ 
tioning well, the self-taught citizen scientists reaching parts 
of discovery space that the robots did not. 

Further detailed qualitative and quantitative analysis 
of the properties of the entire Space Warps sample (new 
and previously identified candidates) and the mass mod- 
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Figure 5. The new Space Warps lens candidates with expert grade G>=1.3. The images are 30"on the side. 


elling analyses for the new candidates will be presented in a 
subsequent Space Warps paper (Verma et ah, in prep.). 


5.3 Measurements of properties of the lens and 
the lensed images 

In the subsequent sections, we compare various properties 
of the lens candidates. Here, we describe how we extract 
or measure these properties, namely, the lens redshift, the 
Einstein radii and the total flux of the lensed images or arcs. 

We use the publicly available redshifts for the lens 
galaxy from the CFHTLS photometric catalogs (Coupon 
et al. 2009). The total flux of the lensed image or arc is 


measured in the ^-band but the adopted method is different 
for different samples. For the simulated sample, we multiply 
the magnification of the second brightest image with the 
source magnitude. For the RingFinder sample, the arcs 
are detected in the scaled difference image of g and z-bands 
from which the lensing galaxy is subtracted (for details, see 
Gavazzi et al. 2014). Here, we use the flux of the lensed im¬ 
ages measured by S Extractor from the scaled difference 
image, that is, g — ai and convert it to the p-band flux using 
mean colours of the foreground and background population. 
For the ArcFinder and the Space Warps sample, we inte¬ 
grate the flux in the image pixels identified by ArcFinder 
or SExtractor. 
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Figure 6. Fraction of lens candidates recovered by Space Warps as a function of the arc magnitude {g band) and the Einstein radius 
for three lens samples, namely, the known lenses, the known lens candidates and the simulated sample. 


The Einstein radius is also measured differently for dif¬ 
ferent samples. For the galaxy-scale lenses in the simulated 
sample, we use the value of the input parameter of the lens 
model for the R^. For group-scale lenses, since the lens 
model is multi-component, we need to determine the 
from the image positions. We use those pairs of lensed im¬ 
ages that have the smallest and the largest angular sepa¬ 
rations. The here is then half of the averaged values of 
these angular image separations. For the RingFinder sam¬ 
ple, we use the peak position of the lensed images measured 
by running S EXTRACTOR on the scaled difference image. 
We calculate the image separation from the lens centre as 
a rough estimate of the Re- For the ArcFinder (SARCS) 
sample, we use the same definition as above except that the 
peak position is identified either by the ArcFinder or man¬ 
ually. For the Space Warps lens sample, the same definition 
is used where the peak positions are identified either with 
ArcFinder or SExtractor. 


5.4 Recovery of known lens samples from the 
CFHTLS by Space Warps 

We now determine the fraction of the known sample of lenses 
that were recovered by Space Warps. Note that this sample 
corresponds to the RingFinder and ArcFinder samples 
combined, as defined in Section 2.2. In Table 2, we show 
that ~ 32% of the known lens candidates, and ~ 65% of the 
known lenses were found at Stage 1. We find that 56% of 
the known lens candidates and 87% of the known lenses from 
Stage 1 passed our Stage 2 selection criterion of P > 0.3 and 
C > 1.3. The left and the middle panels of Figure 6 show 
the fraction of detections as a function of arc magnitude 
and the Einstein radius of the lens systems for the known 
confirmed lenses and lens candidates. As expected, we find 
that systems with brighter images and/or with larger Pe 
are detected more often in Space Warps. 

We find that most of the confirmed lenses and can¬ 
didates that are missed by Space Warps are systems from 
the RingFinder sample, with fainter arcs and smaller Pe- 
The main reason why RingFinder found such candidates 
is because it involves subtracting the light from the lens 
galaxy, making it easier to detect the lensed images during 


both the automated object-finding, and the visual inspec¬ 
tion and classification phases. This approach naturally im¬ 
proves the detection efficiency at smaller Pe and for fainter 
systems. The Space Warps volunteers were not shown any 
galaxy—subtracted images. Showing galaxy—subtracted im¬ 
ages might be a better strategy to adopt for future lens 
searches at galaxy-scales with Space Warps. However, we 
note that accurate modelling and subtraction of the galaxy 
light profile in different bands is challenging and better tech¬ 
niques are being actively developed to enhance detections of 
lenses at small image separations. In Section 6.3 below, we 
further explore and discuss why some of the confirmed lenses 
were missed by Space Warps. 


5.5 Image separation distribution 

The distribution of image separations (i.e. 2 Pe) can be 
used to probe the average density profile of the lens pop¬ 
ulation (Oguri 2006; More et al. 2012). However, the lens 
sample found by the ArcFinder may have incompleteness 
as a function of the image separation. Thus, the lack of un¬ 
derstanding of the selection function of the lens sample may 
affect the constraints on the density profile. A blind lens 
search done by visual inspection alone, for example, through 
Space Warps citizen scientists may find lenses missed by the 
ArcFinder search and thereby, improve completeness. 

Indeed, we have found 59 new medium-high grade lens 
candidates that were not known before. In Figure 7, we 
show the image separation distribution using all the known 
and new lens candidates. The different data points are the 
known RingFinder and ArcFinder sample (green), the 
Space Warps identified (known and new) lens sample (blue) 
only and the combined CFHTLS sample of RingFinder, 
ArcFinder and the new Space Warps lens sample (ma¬ 
genta). It is interesting to note that both the RingFinder 
+ArcFinder and Space Warps samples have very similar 
profiles and thus, the profile of the combined sample has 
not changed significantly. This implies that previous con¬ 
straints on the image separation distribution are robust and 
the ArcFinder selected sample does not suffer from signif¬ 
icant incompleteness for medium to large Re- This is the 
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regime that probes density profiles of galaxy groups to clus¬ 
ters. 

In the figure, we also show for comparison the theo¬ 
retical predictions corresponding to three density profiles, 
namely, isothermal sphere (IS), NFW and a “Total” profile 
which has NFW and Hernquist profiles combined with an 
adiabatically contracting model for the dark matter com¬ 
ponent (Gnedin et al. 2004). These curves are taken from 
More et al. (2012), which gives details of the calculation of 
these predictions. With the updated sample of lens candi¬ 
dates, we confirm our previous prediction that the mass den¬ 
sity profiles of galaxy groups is indeed consistent with the 
“Total” profile. At smaller image separations (< 2 arcsec), 
the “Total” profile converges to isothermal like case and as¬ 
suming these predictions are reliable, we find that the lens 
samples have very low completeness. This is not too sur¬ 
prising compared to the 40% completeness expected for the 
RingFinder sample (Gavazzi et al. 2014). 


6 DISCUSSION 

Finding gravitational lenses is a difficult and complex task. 
No single method is perfect, each method has some advan¬ 
tages over the other. It may be the case that a single method 
may never be the best method for optimising completeness 
and purity. Visual inspection will likely be required for prun¬ 
ing candidates at some stage of lens candidate selection even 
in the future. Therefore, we would like to understand how 
best we should combine the strengths of robots and humans 
to optimize the lens finding method. 

In this section, we first compare the lens candidates 
found by Space Warps and the lens finding robots and then 
attempt to understand why each method failed to detect 
lenses from the other sample. 


6.1 Comparison of the RingFinder, Space Warps 
and ArgFinder samples 

In Figure 8, we show the lens redshift and the arc flux mea¬ 
sured in ^-band AB magnitude as a function of the Ein¬ 
stein radius for the RingFinder (green), the ArgFinder 
(red) and the Space Warps sample (new candidates only in 
blue and known candidates as blue circles). We note that 
the errors on the redshift measurement should not be too 
different across the samples since they are measured by a 
single method. However, the error on the total flux of the 
lensed images are likely to be different across the samples 
and the types of systematics are also different. We have not 
attempted to quantify these errors in this work. With that 
caveat, we find that the Space Warps candidates sample is 
broadly similar to the robotically found lens candidates in 
terms of the flux of the lensed images and the redshift of the 
lensing galaxies. 

The properties considered here do not show any clear 
differences between the types of lenses being found by each 
method. Other properties such as the flux of the lensing 
galaxies and the surface brightness of the lensed images may 
be useful in showing some qualitative differences but this is 
beyond the scope of our current analysis. A more detailed 
and accurate analysis is deferred to the future. 



Figure 7. Image separation distribution (ISD). Comparing the¬ 
oretical predictions (solid curves) with the CFHTLS known 
lens samples (green) and the combined sample of known and 
Space Warps lens candidates (magenta). The sample of new and 
the known lens candidates discovered from Space Warps alone is 
shown in blue. The new updated profile of the ISD (magenta) is 
consistent with our previous measurements and strengthens our 
conclusion that the average density profiles of the lenses are sim¬ 
ilar to the Total profile. 

In Figure 9, we show the relative distribution of num¬ 
ber of candidates from each sample as a function of the Ein¬ 
stein radius and arc magnitude. The light blue colour shows 
the overlap between the Space Warps and the RingEinder 
samples and the purple colour shows the overlap between 
the Space Warps candidates and the ArgEinder samples. 
As noted earlier, the RingEinder dominates the small 
Re(< 2") detections although Space Warps does find a 
modest number of candidates in this range. At larger Re, 
Space Warps sample begins to dominate and is compara¬ 
ble to the ArgEinder sample. As a function of the arc 
magnitudes, all three samples have detections at all mag¬ 
nitudes and median magnitudes for all samples is around 
g ~ 24.5. Relatively, the RingEinder sample spans a nar¬ 
rower range compared to the Space Warps and ArgEinder 
sample. However, this can be verified only after understand¬ 
ing and accounting for the systematic uncertainties in our 
measurements. 

6.2 Why were the new Space Warps candidates 
missed by the robots? 

We test the RingEinder and ArgEinder on images centred 
on the new Space Warps candidates to trace and understand 
at what stage the algorithm failed to detect them. 

Eirst, we re-ran RingEinder on the new Space Warps 
sample. At the beginning, a galaxy catalog is generated 
based on magnitude, redshift and SED type (Gavazzi et al. 
2014, see) to select galaxies which are most likely to act 
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Figure 8. Comparison of the lens redshift and the arc mag¬ 
nitude with the Einstein radius for all of the three lens sam¬ 
ples, namely, the RingFinder (green dots). Space Warps (known 
candidates—blue circles and new candidates only—blue dots) and 
ArcFinder (red dots). All samples have broadly similar proper¬ 
ties. 


as lenses. We find that about 40 per cent of the new 
Space Warps candidates failed to meet this initial selection 
criteria, for example, SWl, SW14, SW20, SW23, SW27 and 
SW30. All of the lensing galaxies are bright enough to sat¬ 
isfy the magnitude criterion (i < 22). However, some of them 
have a bright companion galaxy, some of them do not look 
like E/SO type galaxies and some are edge on galaxies which 
could be the reason for these galaxies having failed the pho¬ 
tometric redshift and SED type pre-selection. 

In subsequent RingFinder steps, the flux from the 
galaxy is subtracted from the scaled difference image to en¬ 
hance the visibility of the faint blue lensed features. An ob¬ 
ject finder is then run on this image to quantify the lensed 
image properties. 

Another ~ 50 per cent of the Space Warps candidates 
could not be detected by the object finder because prop¬ 
erties such as the image area, axis ratio, magnitude/colour 
and alignment with respect to the lensing galaxy were not 
satisfied. Some of the candidates missed at this stage are, for 
example, SW4, SW5, SW6, SW26, SW36, SW39 and SW46. 

Next, we re-ran the ArcFinder on the same 
Space Warps sample of new candidates. The ArcFinder is 
directly run on the images to look for elongated arc like ob¬ 
jects and does not require a list of targets to begin with. Ob¬ 
jects are identified by placing thresholds on the noise level 
in the images. Thus, ArcFinder detections are sensitive to 
changes in the noise levels. 

Originally, the ArcFinder was run on a large image 
with an area of ~ 19350 x 19350 pixels^. For the rerun, 
we worked with much smaller images because this is faster. 


However, this alters the measured noise and hence affects 
the number and type of arc detections. We find that about 
30 per cent of the new candidates were detected without 
changing any of the thresholds in the code, suggesting that 
these could have been detected by ArcFinder had its noise 
thresholds been set differently. 

The ArcFinder code calculates second order bright¬ 
ness moments around every pixel to decide if the distribution 
of flux is elongated in some direction in order to detect elon¬ 
gated arc-like objects. An elongation estimator is assigned 
to every pixel. All pixels with a value of the elongation esti¬ 
mator above a certain threshold are connected to form the 
arc feature. This is called the segmentation of the arc can¬ 
didate. Subsequently, arc properties such as the area, mean 
flux, length and curvature are determined. We relaxed the 
threshold at the segmentation stage and also relaxed thresh¬ 
olds mainly on the area of the arc. These new settings led to 
the detection of about 75 per cent of the new Space Warps 
candidates. We find that relaxing thresholds on other arc 
properties does not improve the detection rate significantly. 

Typically then, the Space Warps candidates were missed 
from the ArcFinder sample either because a) the arcs were 
fainter, b) the flux of the arc and the galaxy were blended 
together (such that the ArcFinder incorrectly connected 
part of the galaxy to the arc), c) the arcs were unusually 
short or thick, or d) the lensed images are almost circular 
or point-like (ArcFinder was not designed to detect lensed 
quasars). 

Relaxing the ArcFinder thresholds obviously increases 
the number of candidate arc detections but this also in¬ 
creases the false positive rate. For example, the number of 
arc candidate detections increased by a factor of ~2 when we 
relaxed the thresholds in the rerun described above, while 
the number of false positives increased by a factor of ~5. 
While the ArcFinder sample purity could be increased by 
cross-correlating the arc candidate positions with a putative 
lens galaxy catalog, these numbers illustrate the predica¬ 
ment facing automated lens-finding algorithms, and the con¬ 
tinuing benefits of visual screening. 


6.3 False negatives: known lenses missed by 

Space Warps 

Like any lens finding method, the Space Warps system could 
potentially be failing to detect certain kinds of lenses. We 
find that about 35 per cent of the known sample of lenses 
are missed at Stage 1; about 10 per cent losses were in¬ 
curred during the Stage 2 refinement (see Table 2). Below, 
we focus on the known lens sample at Stage 1 to understand 
why some of them are being missed and possibly find a way 
to improve the detection rate which can be adopted in the 
future Space Warps lens searches. 

Many of the missed lenses are from the RingFinder 
sample with small Einstein radii and faint lensed images (see 
Figure 6). Among the confirmed lenses from RingFinder, 
about 45 per cent are missed. Out of the missed sample of 
15 lenses, about half of them are visually difficult to detect. 
The other half appear to have faint blue smudges around 
galaxies which should have been easier to identify. Similarly, 
if we consider the ArcFinder lens sample, ~20 per cent are 
missed by Space Warps. This is a relatively small sample of 
~ 5 systems and visual inspection of them suggests that. 
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Figure 9. Candidate detections by the RingFinder, Space Warps and the ArcFinder as a function of the Einstein radius and g band 
magnitude of the lensed images. 


by and large, either the lensed features are faint or they 
have odd properties which makes them difficult to identify 
correctly. For further tests, we combine the RingFinder 
and ArcFinder sample. 

For a lens finding method which uses the collective skill, 
experience and knowledge of a group of volunteers, it may be 
difficult to find a single factor with certainty which causes 
a lens candidate to be missed. We attempt to understand 
whether there is indeed a single dominant factor that is re¬ 
sulting in the loss of these lenses, or if the lenses are being 
missed due to a combination of multiple reasons. Below, we 
consider some of the factors that could affect the efficiency 
of finding lenses. 

6.3.1 Number of classifications 

First, we check whether the number of classifications 
(Nclass) is significantly lower for the missed sample com¬ 
pared to the detected one. Most of the lenses in the known 
sample (including both those that were detected and those 
that were missed) received similar numbers of classifications 
to the other subjects. A few received Nclass>20, possibly 
as a result of continuing to remain for a long time in the 
database because there was uncertainty over whether or not 
to reject them. Overall, we do not find any difference in 
the number of classifications between the detected and the 
missed lenses. 

6.3.2 Lens positions within the image cutouts 

The efficiency of a visual search could potentially vary in 
different sections of an image. Our eyes tend to focus usually 
at the centre of an image and lens candidates close to the 
borders could go undetected. Therefore, it is important to 
check whether Space Warps could be missing some of the 
known lenses because they happen to be close to the borders 
of the image cutouts. 

From the SWAP, the image cutouts inspected by the 
Space Warps volunteers receive a status of detected (if P > 


Raccthresh), rcjccted (if P < Prejthresh) and undccidcd (if 
Prejthresh < P < Paccthresh). In Figure 10, wc comparc 
the positions of lenses which are detected (red), undecided 
(green) and rejected (blue). The left and the right panels 
have the simulated lens sample and the known lens can¬ 
didates sample, respectively. We note that the density of 
points do not represent the actual number of detections be¬ 
cause, for cases with large sample size, randomly drawn sub- 
samples are shown for the ease of visual comparison. 

We do not find any strong correlation in the detection 
rate of lenses as a function of their positions in the image, 
for either the simulated or the known lens sample. Thus, 
the completeness of the lens sample is most likely not signif¬ 
icantly affected by lenses located close to the image borders. 

6.3.3 Classification Power 

Each Space Warps image classification is based on the mark¬ 
ers placed by around 10 volunteers (on average. Paper I). 
This number could be small enough to introduce some scat¬ 
ter in the system performance, arising from the variations 
between the small groups of volunteers inspecting each sub¬ 
ject. In Paper I we investigated the system performance in 
terms of the “Skill” of each agent; in Appendix A we define 
a complementary property, the “Power” of a classification to 
make a large difference in the probability P of an image con¬ 
taining a lens. Here, we investigate whether the distribution 
of classification power is systematically different between the 
detected and missed lenses. 

We check how the posterior probability P (see Paper I 
for the mathematical definition) of an image or a subject 
to contain a lens changes as the image receives more clas¬ 
sifications from multiple volunteers. A graphical represen¬ 
tation of changing probabilities for increasing classifications 
is called a trajectory plot. In Figure 11, we show the tra¬ 
jectory plots of a few examples of detected lenses (top row 
of panels) and missed lenses (bottom row of panels) from 
Stage 1 of Space Warps. Every subject is assigned a prior 
probability Pq = 2 x 10“^ (grey dashed line) and starts at 
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the middle of the trajectory plot. The number of classifica¬ 
tions (Nclass) for a subject increases from top to bottom 
(subjects move down the trajectory plots as they are classi¬ 
fied). The P value of a subject is updated with every clas¬ 
sification from the volunteer. If a volunteer identifies a lens 
candidate, the trajectory moves to the right otherwise moves 
to the left. A subject is accepted if it crosses the blue-dashed 
line marking the Paccthresh (set to 0.95 for Stage 1) on the 
right. It is rejected if it crosses the red-dashed line marking 
the -Prejthresh (set to 10 ^ for Stage 1) on the left. 

The amount by which the posterior probability P value 
of a subject will change depends on how well the volunteers 
are performing on the training sample and its current proba¬ 
bility. Thus, for a given current probability, some volunteers 
will change the P by a large factor compared to others. This 
is evident in the trajectory plots, which show both large and 
small distances between consecutive points which we refer 
to as kicks. Comparison of the kick sizes between the de¬ 
tected and the missed lenses suggests that the missed lenses 
do not have as many volunteers giving large kieks. We also 
note that most of the large kicks seen in the trajectories 
of the missed lenses seem to be moving the subjects to the 
right. In other words, certain “high power” volunteers are 
mostly classifying them as subjects with lens candidates. 

The bottom panels of Figure 11, show the trajectories 
of missed lenses for the cases which are visually easier (light 
green) and more difficult (dark green) to identify. In spite 
of some mild qualitative differences, both set of trajectories 
have very similar behaviour. The trajectories in panel (e) are 
typical of this sample in terms of Nclass and the dominance 
of small negative kicks. Panel (f) represents a small fraction 
of this sample where the kicks are only small and negative. 
The panel (g) shows how some lenses receive a bunch of 
large positive kicks which are led to rejection by still mostly 
small negative kicks. Finally, panel (h) shows those cases 
of lenses which received almost sufficient number of large 
positive kicks to be detected but ended up being rejected. 

The detected lenses shown with green trajectories, in 
the upper panels of Figure 11, can be thought of as coun¬ 
terparts of the trajectories of the missed lenses in the cor¬ 
responding bottom panels except that their classification 
power is different. Most detected lenses are similar to the 
case in panel (a) that are detected within a few classifica¬ 
tions coming from large positive kicks. Panel (b) represents a 
few odd cases which are dominated mainly by small positive 
kicks. Panel (c) shows a lens getting more classifications, but 
not reaching the detection or rejection thresholds because of 
the tug between positive and negative kicks mostly from ex¬ 
perienced volunteers. Panel (d) represents two extreme cases 
when the images are on the verge of being rejected but are 
saved thanks to a series of large positive kicks. The red tra¬ 
jectories are some more examples of randomly selected cases 
which demonstrate how having sufficient number of large 
positive kicks allows lenses to be detected in spite of several 
small negative kicks. 

For a quantitative comparison of the large and small 
kicks for the entire samples of detected and missed known 
lenses, we show a plot of histogram on the right of Figure 11. 
Qualitatively, there are four types of volunteers making clas¬ 
sifications: those causing large, positive kicks (correct clas¬ 
sifications by high power volunteers), those causing small, 
positive kicks (correct classifications by low power volun¬ 


teers), those causing small, negative kicks (incorrect classi¬ 
fications by low power volunteers) and those causing large, 
negative kicks (incorrect classifications by high power volun¬ 
teers) . The four histograms in the figure correspond to these 
four types of volunteers for each sample (that is, detected 
or missed). In this plot, the kick size is defined as small if 

Alog-P(= log -Pcurrent log -^previous) A log -Pcut (choSCU 

as 1.2) and is large if greater than AlogPcut- 

Some of the key inferences are as follows, i) The ratio 
of positive kicks to negative kicks for the detected sample is 
higher than in the missed sample, suggesting that the frac¬ 
tion of volunteers making positive kicks is higher for the de¬ 
tected sample, ii) The number of classifications received by 
the missed sample is dominated by small negative kicks. In 
contrast, for the detected sample there are comparable con¬ 
tributions from all three types - small positive kicks, large 
positive kicks, and small negative kicks, iii) The number 
of classifications providing large negative kicks is lower for 
both the detected and the missed samples. This is consistent 
with our expectation that high power volunteers should not 
be making incorrect classifications. We conclude that one of 
the major factors in the Space Warps system missing the 
known lenses is a lack of high power classifications. 

As a demonstration, we re-ran SWAP for Stage 1 
using only classifications that produced large kicks, with 
IA log PI > 1.2. This obviously meant reducing the total 
number of classifications per subject by a large fraction. As 
a result, we also needed to change the Paccthresh- Choosing 
this to be 0.1, we found that about a third of the lenses 
that had previously been missed were now detected, while 
all the previously detected lenses were again detected. The 
remaining missed lenses simply do not have enough clas¬ 
sifications from volunteers producing large positive kicks. 
This experiment shows that it may be possible to increase 
the Space Warps completeness by preferentially showing cer¬ 
tain rejected systems - those that had never received a high 
power classification - to volunteers capable of making such 
classifications. 

Changing the rejection and acceptance thresholds will 
likely decrease the purity along with improved complete¬ 
ness. This will need to be further quantified before detailed 
recommendations can be made. However, dynamically as¬ 
signing certain subjects to volunteers according to various 
measures of their skill seems like a fruitful line of investi¬ 
gation when seeking to improve the system performance in 
future. 


7 SUMMARY AND CONCLUSIONS 

We report the discovery of gravitational lens candidates 
from the first Space Warps lens search. In this search, volun¬ 
teers were shown g — r — i colour images of random regions of 
the sky taken by the CFHT Legacy Survey. The aim of this 
blind lens search was to find lenses that had been missed 
by previous searches done on the CFHTLS with lens finding 
algorithms. 

The search was carried out in two stages. In Stage 1, vol¬ 
unteers inspected ~ 430,000 images, and selected a smaller 
sample of ~ 3000 images as having interesting lens candi¬ 
dates. In Stage 2, after a careful second inspection of the 
candidates from Stage 1, a purer sample of ~ 500 candidates 
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Figure 10. Completeness as a function of the positions of the lens systems. Simulated lenses (left) and real lens candidates (right) are 
shown. Irrespective of the status of the lenses, that is, detected, undecided or rejected, there is no strong dependency on the location of 
the lenses, both for the simulated and the real sample of candidates. 
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Figure 11. Left: Examples of Stage 1 trajectories of some known lenses. Upper panels show detected lenses, lower panels show missed 
lenses (false negatives). The green trajectories for the detected sample are counterparts of the light and dark green trajectories of the 
missed sample except that the kicks are positive for the detected sample. The red trajectories of the detected sample demonstrate 
that sufficient number of positive large kicks can lead to the detection of the lenses. Within the missed sample, the light and dark 
green correspond to visually easier and more difficult to identify systems, respectively. Right: Histogram of classification power received 
by known lenses. Among the sample of missed lenses, most classifications are from the low power volunteers identifying these images 
incorrectly (i.e. —A log(P)) which overshadows the small number of correct identifications (i.e. A log(P)) coming from both the less and 
the high power volunteers combined. For the detected lens sample, most classifications are correct identifications coming from both low 
and high power volunteers. 


was obtained. In a final step, these images were inspected 
by three of us (AM, AV and PJM) to produce a sample 
of candidates with grades ranging from possibly a lens (1) 
to almost certainly a lens (3). In this paper, we presented 
this new Space Warps sample and compared it with the pre¬ 
viously known samples from two robotic searches from the 
CFHTLS, namely, RingFinder and ArcFinder. 


Our conclusions are as follows: 

• Space Warps works well as a discovery engine for gravi¬ 
tational lenses through citizen science. While a targeted vi¬ 
sual search may be more efficient, we show that the blind 
search works reasonably well too. 

• We use a sample of simulated lenses, duds and impos¬ 
tors tailored to the CFHTLS data to train the volunteers 
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and calibrate their performance. The volunteers not only 
perform well on the training sample (see Paper I) but also 
hnd lenses that are fainter, more compact or redder than 
covered by the training sample demonstrating their adapt¬ 
ability in this task. 

• We present a sample of 29 new gravitational lens can¬ 
didates, and an additional 30 medium grade systems. These 
59 candidates received averaged grade G^l.3 from three ex¬ 
perts following the scale where (1) means possibly, (2) means 
probably, and (3) means almost certainly, a lens. In addition, 
among the G^l.3 sample, we re-discovered 82 lens candi¬ 
dates from various samples published in the literature. 

• Gompared to the sample of RingFinder and 
ArcFinder robotically detected lens candidates, the 
Space Warps sample finds lens systems with statistically 
similar properties, including the range of lens redshifts, 
lensed image total magnitudes, and Einstein radii. How¬ 
ever, having only displayed images without the lens galaxy 
light subtracted, the Space Warps sample does not contain 
many of the RiNGFiNDER-identified-lensed images with sub- 
arcsecond Re, just because the flux of the typically faint 
lensed images is obscured by the flux from bright lensing 
galaxies. 

• Qualitatively, Space Warps seems to have found lens 
systems with different types of lensing galaxies, for exam¬ 
ple, elliptical, spiral (face on and edge on) and small red 
galaxies unlike those found from robotic searches. Similarly, 
the lensed images too have diverse properties such as differ¬ 
ent colours, morphologies and sizes which are again typically 
missed by any given algorithm. 

• Based on the known sample of lenses and lens candi¬ 
dates, we hnd that we lose a small fraction of them during 
Stage 2 rehnement. It is more important to improve the lens 
detection sensitivity at the initial Stage 1 classihcation step. 
About 35% of the known lenses (20 in total) were missed at 
Stage 1. Two thirds of these missed lenses were found to be 
galaxy-scale RingFinder systems, with faint arcs blended 
with the bright lens galaxies. 

• It is possible to improve the Space Warps completeness 
by changing the strategy of when and who is shown an im¬ 
age: only using high power classihcations recovers 40% of 
the missed lenses. 

The discovery of many new lens candidates through the 
hrst Space Warps lens search has demonstrated that the citi¬ 
zen scientists have successfully taught themselves to identify 
lenses within a short span of time. They have found lens can¬ 
didates which the algorithms failed to discover. Upcoming 
and planned wide-field imaging surveys such as the DES, 
HSG, Euclid and LSST will produce formidable amounts 
of data. Blind lens searches as described here will be im¬ 
practical with these very large surveys. However, it should 
be possible to conduct a blind search on a sub-area of a 
large survey in order to assess the performance of either 
the algorithms or the volunteers, the results of which can 
be extrapolated to the entire survey. As demonstrated in 
this paper, any one approach for hnding lenses from the en¬ 
tire survey data may not be sufficiently complete and pure. 
Thus, combining robotic methods for pre-selection with the 
citizen science approach for visual screening might be a good 
strategy for finding lenses in these large imaging surveys. 
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Table 3: Sample of the Space Warps new lens candidates. 


SW ID 

Name 

RA 

(deg) 

Dec 

(deg) 

Zph.ot 

mi 

(mag) 

(”) 

G 

ZooID 

P 

Comments 

SW19 

CFHTLS J020642.0-095157 

31.67504 

-9.86584 

0.2 

20.8 

0.9 

2.0 

ASW00011d7 

0.8 

A,R 

SW8 

CFHTLS J020648.0-065639 

31.70031 

-6.94430 

0.8 

20.2 

1.3 

2.3 

ASW00099ed 

0.4 

A,E 

SW43 

CFHTLS J020810.7-040220 

32.04497 

-4.03891 

1.0 

20.8 

1.8 

1.3 

ASW0001c3j 

0.7 

A,R 

SW9 

CFHTLS J020832T-043315 

32.13396 

-4.55429 

1.0 

21.0 

1.6 

2.3 

ASW0002asp 

1.0 

A,R 

SWIO 

CFHTLS J020848.2-042427 

32.20110 

-4.40751 

0.8 

20.5 

1.1 

2.3 

ASW0002bmc 

0.9 

D,D 

SWll 

CFHTLS J020849.8-050429 

32.20784 

-5.07494 

0.8 

20.6 

0.9 

2.3 

ASW0002qtn 

1.0 

A,R 

SW44 

CFHTLS J02102L5-093415 

32.58981 

-9.57109 

0.4 

18.4 

2.7 

1.3 

ASW0002k40 

0.4 

D,S 

SW30 

CFHTLS J021057.9-084450 

32.74148 

-8.74745 

0.0 

0.0 

2.5 

1.7 

ASW0002p8y 

0.4 

A,G 

SW20 

CFHTLS J021221T-105251 

33.08810 

-10.88106 

0.3 

17.9 

1.8 

2.0 

ASW0002dx7 

0.8 

D,E/S 

SW45 

CFHTLS J021225.2-085211 

33.10511 

-8.86973 

0.8 

19.5 

2.1 

1.3 

ASW00024id 

1.0 

R,R 

SW46 

CFHTLS J021317.6-084819 

33.32341 

-8.80549 

0.5 

19.8 

1.3 

1.3 

ASW00024q6 

0.4 

A,R/E 

SW31 

CFHTLS J021514.6-092440 

33.81089 

-9.41115 

0.7 

19.9 

2.6 

1.7 

ASW00021r0 

0.4 

A,R/G 

SW32 

CFHTLS J022359.8-083651 

35.99955 

-8.61439 

0.0 

0.0 

3.1 

1.7 

ASW0004iye 

0.4 

A,E 

SW12 

CFHTLS J022406.1-062846 

36.02558 

-6.47963 

0.4 

19.6 

0.9 

2.3 

ASW0003wsu 

0.7 

A,E 

SWl 

CFHTLS J022409.5-105807 

36.03978 

-10.96885 

0.0 

0.0 

4.8 

3.0 

ASW0004dv8 

1.0 

A,G 

SW21 

CFHTLS J022533.3-053204 

36.38882 

-5.53460 

0.5 

19.4 

3.6 

2.0 

ASW0004m3x 

0.4 

A,R/G 

SW22 

CFHTLS J022716.4-105602 

36.81856 

-10.93410 

0.4 

17.3 

1.8 

2.0 

ASW0009ab8 

0.7 

A,E/G 

SW33 

CFHTLS J022745.2-062518 

36.93868 

-6.42183 

0.6 

20.5 

1.2 

1.7 

ASW0003s0m 

0.5 

A,R 

SW13 

CFHTLS J022805.6-051733 

37.02362 

-5.29266 

0.4 

18.8 

1.4 

2.3 

ASW00047ae 

1.0 

Q,E 

SW47 

CFHTLS J022843.0-063316 

37.17942 

-6.55465 

0.5 

19.1 

1.8 

1.3 

ASW0003r6c 

0.3 

D/A,E 

SW23 

CFHTLS J023008.6-054038 

37.53591 

-5.67744 

0.6 

19.7 

1.9 

2.0 

ASW0003r61 

0.5 

A,E 

SW14 

CFHTLS J023123.2-082535 

37.84682 

-8.42663 

0.0 

0.0 

1.2 

2.3 

ASW0004xjk 

0.3 

A,R 

SW24 

CFHTLS J023315.2-042243 

38.31334 

-4.37886 

0.7 

19.7 

1.0 

2.0 

ASW00050sk 

0.8 

A,R 

SW34 

CFHTLS J023453.5-093032 

38.72321 

-9.50892 

0.5 

19.8 

0.7 

1.7 

ASW000511d 

0.3 

A,D 

SW35 

CFHTLS J084833.2-044051 

132.13847 

-4.68085 

0.8 

20.2 

0.9 

1.7 

ASW0004wgd 

0.7 

A,R 

SW15 

CFHTLS J084841.0-045237 

132.17084 

-4.87720 

0.3 

19.0 

1.0 

2.3 

ASW0004nan 

1.0 

A,E 

SW48 

CFHTLS J090219.0-053923 

135.57947 

-5.65666 

0.0 

0.0 

2.0 

1.3 

ASW0000g95 

1.0 

A,R/E 

SW36 

CFHTLS J090248.4-010232 

135.70204 

-1.04243 

0.4 

19.1 

1.4 

1.7 

ASW000096t 

0.6 

D,E 

SW25 

CFHTLS J090308.2-043252 

135.78449 

-4.54789 

0.0 

0.0 

1.3 

2.0 

ASW00007mq 

0.6 

D,D 

SW49 

CFHTLS J090319.4-040146 

135.83105 

-4.02971 

0.0 

19.8 

1.2 

1.3 

ASW000071S 

0.5 

A,R/E 

SW50 

CFHTLS J090333.2-005829 

135.88869 

-0.97490 

0.0 

0.0 

2.1 

1.3 

ASW00008a0 

1.0 

A/D,E/G 

SW51 

CFHTLS J135724.8+561614 

209.35374 

56.27066 

0.0 

0.0 

2.6 

1.3 

ASW0006e0o 

0.9 

D,E 

SW26 

CFHTLS J135755.8+571722 

209.48268 

57.28971 

0.8 

20.2 

0.9 

2.0 

ASW0005ma2 

0.8 

A,R 

SW52 

CFHTLS J140027.9+541028 

210.11636 

54.17455 

0.0 

0.0 

1.2 

1.3 

ASW0006a07 

0.6 

Q,R/E 

SW16 

CFHTLS J140030.2+574437 

210.12601 

57.74371 

0.4 

18.2 

2.0 

2.3 

ASW0009bp2 

0.6 

A,E 

SW2 

CFHTLS J140522.2+574333 

211.34261 

57.72587 

0.7 

19.7 

1.0 

2.7 

ASW000619d 

0.7 

A,R 

SW17 

CFHTLS J140622.9+520942 

211.59581 

52.16169 

0.7 

20.3 

1.2 

2.3 

ASW0005rnb 

0.7 

A,R 

SW27 

CFHTLS J141432.9+534004 

213.63716 

53.66788 

0.7 

21.4 

1.0 

2.0 

ASW0006jh5 

0.8 

A,R 

SW53 

CFHTLS J141518.9+513915 

213.82903 

51.65420 

0.4 

18.3 

3.0 

1.3 

ASW00070vl 

0.8 

D,E 

SW3 

CFHTLS J142603.2+511421 

216.51375 

51.23935 

0.0 

0.0 

4.4 

2.7 

ASW0006mea 

0.7 

A,G 

SW54 

CFHTLS J142620.8+561356 

216.58699 

56.23230 

0.5 

19.5 

1.3 

1.3 

ASW0007sez 

0.8 

A/R,S 

SW55 

CFHTLS J142652.8+560001 

216.72004 

56.00044 

0.0 

0.0 

1.5 

1.3 

ASW0007t5y 

1.0 

R,R 

SW56 

CFHTLS J142843.5+543713 

217.18153 

54.62036 

0.4 

19.7 

1.3 

1.3 

ASW0007pga 

0.6 

D,D 

SW4 

CFHTLS J142934.2+562541 

217.39261 

56.42807 

0.5 

19.0 

5.9 

2.7 

ASW0009cjs 

0.8 

A,G 

SW28 

CFHTLS J143055.9+572431 

217.73332 

57.40883 

0.7 

19.3 

1.6 

2.0 

ASW0007xrs 

0.9 

A,R/G 

SW37 

CFHTLS J143100.2+564603 

217.75124 

56.76750 

0.0 

0.0 

1.2 

1.7 

ASW00086xq 

0.8 

A,E 

SW38 

CFHTLS J143353.6+542310 

218.47357 

54.38624 

0.8 

19.8 

1.8 

1.7 

ASW0009cp0 

0.7 

A,E 

SW5 

CFHTLS J143454.4+522850 

218.72702 

52.48080 

0.6 

19.4 

4.4 

2.7 

ASW0007k4r 

0.4 

Q,G/R 

SW6 

CFHTLS J143627.9+563832 

219.11636 

56.64249 

0.5 

19.5 

1.5 

2.7 

ASW0008swn 

0.9 

A,D 

SW57 

CFHTLS J143631.5+571131 

219.13155 

57.19215 

0.7 

20.9 

1.3 

1.3 

ASW0008pag 

0.6 

D/A,R 

SW58 

CFHTLS J14365L6+530705 

219.21503 

53.11832 

0.6 

19.2 

3.1 

1.3 

ASW0007iwp 

0.7 

A,E/G 

SW18 

CFHTLS J143658T+533807 

219.24246 

53.63550 

0.7 

19.6 

0.9 

2.3 

ASW0007hu2 

0.6 

D,D 

SW29 

CFHTLS J143838.1+572647 

219.65887 

57.44645 

0.8 

20.2 

1.1 

2.0 

ASW0008qsm 

0.9 

A,R 

SW59 

CFHTLS J143950.6+544606 

219.96101 

54.76858 

0.0 

0.0 

1.7 

1.3 

ASW00085cp 

0.4 

A,G/R 

SW39 

CFHTLS J220215.2+012124 

330.56348 

1.35667 

0.3 

17.4 

4.6 

1.7 

ASW0005qiz 

0.5 

rA,G 

SW7 

CFHTLS J220256.8+023432 

330.73691 

2.57581 

0.0 

0.0 

6.8 

2.7 

ASW0007e08 

0.8 

A,G 

SW40 

CFHTLS J221306.1+014708 

333.27579 

1.78561 

0.0 

17.1 

1.4 

1.7 

ASW0008wmr 

0.9 

A,S 

SW41 

CFHTLS J221519.7+005758 

333.83212 

0.96615 

0.4 

20.2 

1.0 

1.7 

ASW0008xbu 

0.8 

A,D 
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SW ID 

Name 

RA 

(deg) 

Dec 

(deg) 

'^(phot 

mi 

(mag) 

Re 

(”) 

G 

ZooID 

P 

Comments 

SW42 

CEHTLS J221716.5+015826 

334.31894 

1.97394 

0.1 

21.6 

1.0 

1.7 

ASW00096rm 

1.0 

A/R,R 


The column Comments has two type of notes. The first is about the lens image configuration where the symbols mean the 
following A: Arc, D: Double, Q: Quad, R: Ring. The second is a comment on the type of lens assessed visually. Note that this 
classification is not based on colours or spectral analysis. The symbols are E: Elliptical, S: (face on) Spiral, G: Group-scale, D: 
Edge on disk, R: Red star-forming galaxy. 
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APPENDIX A: LENS DETECTION POWER 

In Paper I, we defined the “Skill” of an agent as being given 
by the expectation value of the information gain per classi¬ 
fication. This quantity is a non-linear function of both the 
Pl, the probability of correctly identifying a lens as a lens 
and Pd, the probability of correctly identifying a dud as a 
dud. This means that one can get the same value of Skill 
for different combinations of Pl and Pd (see the left panel 
of Figure Al). The skill reflects the all-round ability of a 
classifier to contribute information. 

As described in Paper I, the posterior probability P of a 
subject is determined by the Pl and Pd of all the volunteers 
who clicked on the subject, via Bayes’ Theorem. Each agent 
will apply a “kick” of a different size to the subject proba¬ 
bility, A log P, which can be either positive (if the classifier 
thinks the subject contains a lens) or negative (if the classi¬ 
fier thinks the subject does not contain a lens). For instance, 
given a subject containing a lens, a volunteer with high Pl 
implies a large positive kick irrespective of the value of Pd, 
as shown in the middle panel of Figure Al. However, large 
positive kicks are still possible for a volunteer located in the 
upper triangle with different combinations of (Pl, Pd) sug¬ 
gesting that the kick is not a simple function of (Pl, Pd)- 


The kicks appear as steps on the subject’s trajectory 
plot. This kick magnitude gives a useful measure of an 
agent’s “Power” to move images closer to detection. Note 
that a volunteer who is very good at rejecting duds, but not 
so good at identifying lenses, may have a high Skill but a low 
Power (since they may fail to detect many of the interest¬ 
ing lenses): Power provides a more precise quantification of 
a classifier’s ability to detect lenses (compared to rejecting 
non-lenses). 
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